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INTRODUCTION 


The Am9511 APU is a complete, high performance, complex 
arithmetic processor contained within a single chip. It is de¬ 
signed to enhance the number manipulation capability of a 
wide variety of processor systems. It includes not only 
floating-point operations but fixed-point as well; not only basic 
add, subtract, multiply and divide operations, but a group of 
transcendental derived functions plus control and conversion 
commands as well. This Application Brief provides detailed 
descriptions of all the commands that can be executed by the 
Am9511 and indicates the error performance of the derived 
functions. 


The Am9511 is packaged in a standard, 24 pin, dual in-line 
package with .6 inch between rows. Figure 1 shows the pack¬ 
age pin assignments. Details on the operation of each inter¬ 
face pin will be found in the data sheet. 


The block diagram in Figure 2 shows the internal structure of 
th^APU. The part is addressed as two ports selected by the 
C/D control line. When C/D is high (Control Port), a read op- 
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Figure 2. Arithmetic Processing Unit Block Diagram. 
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eration accesses the statu^register and a write operation en¬ 
ters a command. When C/D is low (Data Port), a read opera¬ 
tion accesses data from the top of the data stack and a write 
operation enters data into the top of the data stack. 

Data Formats 

The APU executes both 16- and 32-bit fixed-point operations. 
All fixed-point operands and results are represented as binary 
two’s complement integer values. The 16-bit format can ex¬ 
press numbers with a range of -32,768 to -1-32,767. The 
32-bit format can express numbers with a range of 
-2,147,483,648 to -H2,147,483,647. 

The floating-point format uses a 32-bit word with fields as 
shown in Figure 3. The most significant bit (bit 31) indicates 
the sign of the mantissa. The next seven bits form the expo¬ 
nent and the remaining 24 bits form the mantissa value. 

The exponent of the base 2 is an unbiased two’s complement 
number with a range of -64 to -f-63. The mantissa is a 
sign-magnitude number with an assumed binary point just to 
the left of the most significant mantissa bit (bit 23). All 
floating-point values must be normalized, which makes bit 23 
always equal to 1 except when representing a value of zero. 
The number Zero is represented with binary zeros in all 32 bit 
positions. 


1 

1-MANTISSA SIGN 

r-EXPONENT SIGN 

1^—EXPONENT ^ 1 -MANTISSA ^ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

31 30 24 23 0 
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Figure 3. Floating Point Format. 


Status Register 

The Am9511 Status register format is shown in Figure 4. 
When the Busy bit (bit 7) is high, the APU is processing a 
previously entered command and the balance of the Status 
register should not be considered valid. When the Busy bit is 
low, the operation is complete and the other status bits are 
valid. 



1 = Carry or Borrow 

1 = Overflow 

1 = Underflow 

01 = Negative Argument 

10 = Zero Divisor 

11 = Argument too Large 

1 = Top of Stack is Zero 

1 = Top of Stack is Negative 

1 = Busy 


Error 

Field 
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Figure 4. Status Register. 


Data Stack 


Figure 5 shows the two logical organizations of the internal 
data stack. It operates as a true push-down stack or FILO 
stack. That is, the data first written in will be the data last read 
out. Within each stack entry, the least significant byte is en¬ 
tered first and retrieved last. 

Figure 6 shows a typical sequence for 32 bit operations. 6a 
represents the stack prior to entry of data. 6b shows the stack 
following entry of the LS Byte of operand C. 6c illustrates the 
stack contents following the entry of four bytes of operand C. 
When operands C, B and A are all fully entered the stack ap¬ 
pears as in 6e. If a command is then issued, to add B to A for 
example, the stack contents look like 6f where R is the result 
of B -I- A. When the first (MSB) byte of R is removed the 
stack appears as in 6g. 6h shows the stack following the 
complete retrieval or R. An even number of bytes should al¬ 
ways be transferred for any data operation. 



TOP OF STACK (TOS) 
NEXT ON STACK (NOS) 



TOP OF STACK (TOS) 
NEXT ON STACK (NOS) 
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Figure 5. Stack Configurations. 


Command Format 

Each command executed by the APU is specified by a singie 
byte with the format shown in Figure 7. Bits 0 through 4 indi¬ 
cate the operation to be performed. Bits 5 and 6 specify the 
data format. Bit 7 is used to control the Service Request inter¬ 
face line. When bit 7 is a one, the SVREQ output will go true 
when the execution of the command is complete. 
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T 

CO 

L 

TOS - ► 

^ 1 1 

TOS » 

A4 , A3 1 A2 1 A1 



1 1 1 


B4 , B3 , B2 , B1 


(a) 

1 1 1 

(e) 

C4 , C3 1 C2 , Cl 



1 1 1 


1 1 1 



-32- ► 



TOS - ► 

1 1 

TOS - ► 

R4 1 R3 1 R2 , R1 



1 1 1 


C4 , C3 1 C2 1 Cl 


(b) 

1 1 1 

(f) 

1 1 1 



1 1 1 


1_L 1 






TOS -► 

o 

CM 

o 

CO 

O 

o 

TOS -► 

R3 , R2 1 R1 1 C4 



1 1 1 


C3 1 C2 1 Cl 1 


(c) 

1 1 1 

(g) 

i 1 1 



1 1 1 


1 1 1 






TOS - ^ 

B1 1 C4 1 C3 , C2 

TOS - ► 

C4 1 C3 , C2 1 Cl 



1 1 


1 1 1 


(d) 

1 1 1 

(h) 

1 1 1 



1 1 1 


i 1 1 
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Figure 6. Stack Data Sequence Example. 



^ OPERATION ^ 

SVREQ SINGLE FIXED 

CODE 

(sr) 1 1 

1 1 1 1 


7 6 5 4 3 2 1 0 
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Figure 7. Command Format. 
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Command 

Hex Code 

Hex Code 

Execution 

Summary 

Mnemonic 

(sr = 1) 

(sr = 0) 

Cycies 


Description 

16-BiT FiXED-POiNT OPERATiONS 

SADD 

EC 

6C 

16-18 



Add TOS to NOS. Result to NOS. Pop Stack. 

SSUB 

ED 

6D 

30-32 



Subtract TOS from NOS. Result to NOS. Pop Stack. 

SMUL 

EE 

6E 

84-94 



Multiply NOS by TOS. Lower result to NOS. Pop Stack. 

SMUU 

F6 

76 

80-98 



Multiply NOS by TOS. Upper result to NOS. Pop Stack. 

SDIV 

EF 

6F 

84-94 



Divide NOS by TOS. Result to NOS. Pop Stack. 

32-BIT FIXED-POINT OPERATIONS 

DADD 

AC 

2C 

20-22 



Add TOS to NOS. Result to NOS. Pop Stack. 

DSUB 

AD 

2D 

38-40 



Subtract TOS from NOS. Result to NOS. Pop Stack. 

DMUL 

AE 

2E 

194-210 


Multiply NOS by TOS. Lower result to NOS. Pop Stack. 

DMUU 

B6 

36 

182-218 


Multiply NOS by TOS. Upper result to NOS. Pop Stack. 

DDIV 

AF 

2F 

196-210 


Divide NOS by TOS. Result to NOS. Pop Stack. 

32-BIT FLOATING-POINT PRIMARY OPERATIONS 

FADD 

90 

10 

54-368 


Add TOS to NOS. Result to NOS. Pop Stack. 

FSUB 

91 

11 

70-370 


Subtract TOS from NOS. Result to NOS. Pop Stack. 

FMUL 

92 

12 

146-168 


Multiply NOS by TOS. Result to NOS. Pop Stack. 

FDIV 

93 

13 

154-184 


Divide NOS by TOS. Result to NOS. Pop Stack. 

32-BIT FLOATING-POINT DERIVED OPERATIONS 

SORT 

81 

01 

782-870 


Square Root of TOS. Result to TOS. 

SIN 

82 

02 

3796-4808 

Sine of TOS. Result to TOS. 

COS 

83 

03 

3840-4878 

Cosine of TOS. Result to TOS. 

TAN 

84 

04 

4894-5886 

Tangent of TOS. Result to TOS. 

ASIN 

85 

05 

6230-7938 

Inverse Sine of TOS. Result to TOS. 

ACOS 

86 

06 

6304-8284 

Inverse Cosine of TOS. Result to TOS. 

ATAN 

87 

07 

4992-6536 

Inverse Tangent of TOS. Result to TOS. 

LOG 

88 

08 

4474-7132 

Common Logarithm of TOS. Result to TOS. 

LN 

89 

09 

4298-6956 

Natural Logarithm of TOS. Result to TOS. 

EXP 

8A 

OA 

3794-4878 

e raised to power in TOS. Result to TOS. 

PWR 

8B 

OB 

8290-12032 

NOS raised to power in TOS. Result to NOS. Pop Stack. 

DATA AND STACK MANIPULATION OPERATIONS 

NOP 

80 

00 

4 



No Operation. Clear or set SVREQ. 

FIXS 

9F 

IF 

90-214 ) 

1 

Convert TOS from floating point format to fixed point format. 



|> 

FIXD 

9E 

IE 

90-336 J 

1 

FITS 

9D 

ID 

62-156 1 

1 

Convert TOS from fixed point format to floating point format. 



[ 

FLTD 

9C 

1C 

56-342 J 

1 

CHSS 

F4 

74 

22-24 

1 

[ 

Change sign of fixed point operand on TOS. 

CHSD 

B4 

34 

26-28 

J 

1 

CHSF 

95 

15 

16-20 



Change sign of floating point operand on TOS. 

PTOS 

F7 

77 

16] 

1 



PTOD 

B7 

37 

20 



Push stack. Duplicate NOS in TOS. 

PTOF 

97 

17 

20 J 

1 



POPS 

F8 

78 

101 

1 



POPD 

B8 

38 




Pop stack. Old NOS becomes new TOS. Old TOS rotates to bottom. 

POPF 

98 

18 

12 ] 

1 



XCHS 

F9 

79 

18' 

1 



XCHD 

B9 

39 

26 



Exchange TOS and NOS. 

XCHF 

99 

19 

26 ' 

1 



PUPI 

9A 

1A 

16 



Push floating point constant n onto TOS. Previous TOS becomes NOS. 


Figure 8. 
4 





































ALGORITHM DISCUSSION 

Computer approximations of transcendental functions are 
often based on some form of polynomial equation, such as: 

F(X) = Aq + A-iX + A2X^ + A3X^ + A4X'^ .... ("I’l) 

The primary shortcoming of an approximation in this form is 
that it typically exhibits very large errors when the magnitude 
of I X I is large, although the errors are small when I X I is 
small. With polynomials in this form, the error distribution is 
markedly uneven over any arbitrary interval. 

Fortunately, a set of approximating functions exists that not 
only minimizes the maximum error but also provides an even 
distribution of errors within the selected data representation in¬ 
terval. These are known as Chebyshev Polynomials and are 
based upon cosine functions.^These functions are defined 
as follows: 

Tn(X) = Cos n 0 ; where n = 0,1,2 .. . (1-2) 

d = Cos~^X 

The various terms of the Chebyshev series can be computed 
as shown below: 

To(X) = Cos (0*0) = Cos (0) = 1 (1-4) 

Ti(X) = Cos (Cos“’X) = X (1-5) 

T 2 (X) = Cos 26 = 2Cos^ fl - 1 = 2 Cos2 (Cos“’X) - 1 (1-6) 
= 2X^-1 

In general, the next term in the Chebyshev series can be re¬ 
cursively derived from the previous term as follows: 


Tn(X) = 2X [Tn-1(X)] - Tn-2 (X); ns 2 (1-7) 

The terms T 3 , T 4 , T 5 and Te are given below for reference: 

T 3 = 4X3 - 3X (1-8) 

T 4 = ex'* - 8X3 -I- 1 gj 

T 5 = 16X® - 20X3 4 . 5 x 
Te = 32X® - 48X'' -I- 18X3 - 1 

Chebyshev polynomials can be directly substituted for corre¬ 
sponding terms of a power series expansion by simple alge¬ 
braic manipulation: 

1 = To (1-12) 

X = Ti (1-13) 

X3 = 1/2 (To -I- T 2 ) (1-14) 

X3 = 1/4 (3T, -t- T 3 ) (1-15) 

X" = 1/8 (3To-H 4 T 2 4-T 4 ) (1-16) 

X® = 1/16 (lOT, -I- 5 T 3 + Te) (1-17) 

X® = 1/32 (IOT 0 -I- I5T2 + 6 T 4 -I- Te) (1-18) 


Each of the derived functions except square root implemented 
in the Am9511 APU has been reduced to Chebyshev poly¬ 
nomial form. A sufficient number of terms has been used to 
provide a mean relative error of about one part in 10 ^. 

Each of the functions is implemented as a three-step process. 
The first step involves range reduction. That is, the input ar¬ 
gument to the function is transformed to fall within a range of 
values for which the function can compute a valid result. For 
example, since functions like sine and cosine are periodic for 
multiples of ttI2 radians, input arguments for these functions 
are converted to lie within the range of -ttI2 to -i-7r/2. Pro¬ 
cessing of the range-reduced input argument according to the 
appropriate Chebyshev expansion is done in the second step. 
The third step includes any necessary post processing of the 
result, such as sign correction in sine or cosine for a particular 
quadrant. Range reduction and post processing are unique to 
each of the functions, while processing the Chebyshev ex¬ 
pansion is performed by an algorithm that is common to all 
functions. 


DERIVED FUNCTION ERROR PERFORMANCE 

Since each of the derived functions is an approximation of the 
true function, results computed by the Am9511 are not always 
exact. In order to more comprehensively quantify the error 
performance of the component, the following graphs have 
been prepared. Each function has been executed with a 
statistically significant number of diverse data values, span¬ 
ning the allowable input data range, and resulting errors have 
been tabulated. Absolute errors (that is, the number of bits in 
error) have been converted to relative errors according to the 
following equation: 


Relative Error = 


Absolute Error 
True Result 


This conversion permits the error to be viewed with respect to 
the magnitude of the true result. This provides a more objec¬ 
tive measurement of error performance since it directly trans¬ 
lates to a measure of significant digits of algorithm accuracy. 

For example, if a given absolute error Is 0.001 and the true 
result is also 0 . 001 , it is clear that the relative error is equal to 
1.0 (which implies that even the first significant digit of the re¬ 
sult is wrong). However, if the same absolute error is com¬ 
puted for a true result of 10000 . 0 , then the first six significant 
digits of the result are correct ( 0 . 001/10000 = 0 . 0000001 ). 

Each of the following graphs was prepared to illustrate relative 
algorithm error as a function of input data range. Natural 
Logarithm Is the only exception; since logarithms are typically 
additive, absolute error is plotted for this function. 

Two graphs have not been included in the following figures: 
common logarithms and the power function (X^). Common 
logarithms are computed by multiplication of the natural 
logarithm by the conversion factor 0.43429448 and the error 
function is therefore the same as that for natural logarithm. 
The power function is realized by combination of natural log 
and exponential functions according to the equation: 

XY = eVLnx 


The error for the power function is a combination of that for 
the logarithm and exponential functions. Specifically, the rela¬ 
tive error for PWR is expressed as follows: 


IREpwrI = IREexpI + |X(AEln)| 


where: 


REpwR = relative error for power function 
REexp = relative error for exponential function 
AEln = absolute error for natural logarithm 
X = value of independent variable in x'*^ 


Notes: 

1. Properties of Chebyshev polynomials taken from: Applied Numer- 
ical Methods ; Carnahan, Luther, Wikes; John Wiley & Sons, Inc.; 
1969. 

2. Derived function algorithms adapted from: Algorithms for Special 
Functions (I and II); Numerische Mathematic (1963); Clenshaw, 
Miller, Woodger. 
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DATA VALUES (RADIANS) 
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SINE 



COSINE 



TANGENT 



DATA VALUES 


MOS-011 


INVERSE SINE 



DATA VALUES 


MOS-012 

INVERSE COSINE 
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DATA VALUES 


_ MOS-013 

INVERSE TANGENT 



DATA VALUES 


MOS-014 

NATURAL LOG 



DATA VALUES 


_ MOS-015 

SQUARE ROOT 




DATA VALUES 

_ MOS-016 

eX 
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COMMAND DESCRIPTIONS 

This section contains detailed descriptions of the APU com¬ 
mands. They are arranged in alphabetical order by command 
mnemonic. In the descriptions, TOS means Top Of Stack and 
NOS means Next On Stack. 

All derived functions except Square Root use Chebyshev 
polynomial approximating algorithms. This approach is used 
to help minimize the internal microprogram, to minimize the 
maximum error values and to provide a relatively even dis¬ 
tribution of errors over the data range. The basic arithmetic 
operations are used by the derived functions to compute the 
various Chebyshev terms. The basic operations may produce 
error codes in the status register as a result. 

Execution times are listed in terms of clock cycles and may 
be converted into time values by multiplying by the clock 
period used. For example, an execution time of 44 clock cy¬ 


cles when running at a 4MHz rate translates to 11 micro¬ 
seconds (44 X .25/xs = ^^|JLS). Variations in execution cycles 
reflect the data dependency of the algorithms. 

In some operations exponent overflow or underflow may be 
possible. When this occurs, the exponent returned in the re¬ 
sult will be 128 greater or smaller than its true value. 

Many of the functions use portions of the data stack as 
scratch storage during development of the results. Thus pre¬ 
vious values in those stack locations will be lost. Scratch loca¬ 
tions destroyed are listed in the command descriptions and 
shown with the crossed-out locations in the Stack Contents 
After diagram. 

Figure 8 is a summary of all the Am9511 commands. It shows 
the hex codes for each command, the mnemonic abbrevi¬ 
ation, a brief description and the execution time in clock cy¬ 
cles. The commands are grouped by functional classes. 

Figure 9 lists the command mnemonics in alphabetical order. 


ACOS 

ARCCOSINE 

LOG 

COMMON LOGARITHM 

ASIN 

ARCSINE 

LN 

NATURAL LOGARITHM 

ATAN 

ARCTANGENT 

NOP 

NO OPERATION 

CHSD 

CHANGE SIGN DOUBLE 

POPD 

POP STACK DOUBLE 

CHSF 

CHANGE SIGN FLOATING 

POPF 

POP STACK FLOATING 

CHSS 

CHANGE SIGN SINGLE 

POPS 

POP STACK SINGLE 

COS 

COSINE 

PTOD 

PUSH STACK DOUBLE 

DADD 

DOUBLE ADD 

PTOF 

PUSH STACK FLOATING 

DDIV 

DOUBLE DIVIDE 

PTOS 

PUSH STACK SINGLE 

DMUL 

DOUBLE MULTIPLY LOWER 

PUPI 

PUSH 7T 

DMUU 

DOUBLE MULTIPLY UPPER 

PWR 

POWER (X^) 

DSUB 

DOUBLE SUBTRACT 

SADD 

SINGLE ADD 

EXP 

EXPONENTIATION (e’') 

SDIV 

SINGLE DIVIDE 

FADD 

FLOATING ADD 

SIN 

SINE 

FDIV 

FLOATING DIVIDE 

SMUL 

SINGLE MULTIPLY LOWER 

FIXD 

FIX DOUBLE 

SMUU 

SINGLE MULTIPLY UPPER 

FIXS 

FIX SINGLE 

SORT 

SQUARE ROOT 

FLTD 

FLOAT DOUBLE 

SSUB 

SINGLE SUBTRACT 

FITS 

FLOAT SINGLE 

TAN 

TANGENT 

FMUL 

FLOATING MULTIPLY 

XCHD 

EXCHANGE OPERANDS DOUBLE 

FSUB 

FLOATING SUBTRACT 

XCHF 

XCHS 

EXCHANGE OPERANDS FLOATING 
EXCHANGE OPERANDS SINGLE 


Figure 9. Command Mnemonics in Alphabetical Order. 
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ACOS 

32-BIT FLOATING-POINT INVERSE COSINE 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

0 

0 

1 

1 

0 


Hex Coding: 86 with sr = 1 
06 with sr = 0 

Execution Time: 6304 to 8284 clock cycles 
Description: 

The 32-blt floating-point operand A at the TOS is replaced by the 
32-bit floating-point inverse cosine of A. The result R Is a value in 
radians between 0 and tt. Initial operands A, B, C and D are lost. 
ACOS will accept all input data values within the range of - 1.0 to 
■f 1. 0 . Values outside this range will return an error code of 1100 
in the status register. 

Accuracy: ACOS exhibits a maximum relative error of 2.0 x 
10 “^ over the valid input data range. 

Status Affected: Sign, Zero, Error Field 

STACK CONTENTS 


ATAN 

32-BIT FLOATING-POINT 
INVERSE TANGENT 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

5r 

0 

0 

0 

0 

1 

1 

1 


Hex Coding: 87 with sr = 1 
07 with sr = 0 


Execution Time: 4992 to 6536 clock cycles 
Description: 

The 32-blt floating-point operand A at the TOS is replaced by the 
32-bit floating-point inverse tangent of A. The result R is a value in 
radians between -7r/2 and -H7r/2. Initial operands A, C and D are 
lost. Operand B is unchanged. 

ATAN will accept all input data values that can be represented in 
the floating point format. 

Accuracy: ATAN exhibits a maximum relative error of 3.0 x 
10 “^ over the input data range. 

Status Affected: Sign, Zero 

STACK CONTENTS 



ASIN 

32-BIT FLOATING-POINT INVERSE SINE 


CHSD 

32-BIT FIXED-POINT SIGN CHANGE 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

0 

0 

1 

0 

1 


Hex Coding: 85 with sr = 1 
05 with sr = 0 

Execution Time: 6230 to 7938 clock cycles 
Description: 

The 32-bit floating-point operand A at the TOS is replaced by the 
32-bit floating-point inverse sine of A. The result R is a value in 
radians between -ttI2 and + 7 r/ 2 . Initial operands A, B, C and D 
are lost. 

ASIN will accept all input data values within the range of -1.0 to 
+1.0. Values outside this range will return an error code of 1 100 
In the status register. 

Accuracy: ASIN exhibits a maximum relative error of 4.0 x 
10 “^ over the valid input data range. 

Status Affected: Sign, Zero, Error Field 

STACK CONTENTS 


BEFORE 


-32 


AFTER 


-TOS 




7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

1 

1 

0 

1 

0 

0 


Hex Coding: B4 with sr = 1 
34 with sr = 0 


Execution Time: 26 to 28 clock cycles 
Description: 

The 32-bit fixed-point two’s complement integer operand A at 
the TOS is subtracted from zero. The result R replaces A at 
the TOS. Other entries in the stack are not disturbed. 

Overflow status will be set and the TOS will be returned un¬ 
changed when A is input as the most negative value possible 
in the format since no positive equivalent exists. 

Status Affected: Sign, Zero, Error Field (overflow) 


STACK CONTENTS 


BEFORE AFTER 


A 

--TOS-^ 

R 

B 


B 

C 


C 

D 


D 

--32-► 


--32-► 
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CHSF 

32-BIT FLOATING-POINT SIGN CHANGE 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

1 

0 

1 

0 

1 


Hex Coding: 95 with sr = 1 
15 with sr = 0 

Execution Time: 16 to 20 clock cycles 
Description: 

The sign of the mantissa of the 32-bit floating-point operand A at 
the TOS is inverted. The result R replaces A at the TOS. Other 
stack entries are unchanged. 

If A Is input as zero (mantissa MSB = 0), no change is made. 
Status Affected: Sign, Zero 

STACK CONTENTS 


BEFORE AFTER 


A 

--TOS-► 

R 

B 


B 

C 


C 

D 


D 

1—-32-► 

!-•-32-^1 


CHSS 

16-BIT FIXED-POINT SIGN CHANGE 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

1 

1 

1 

0 

1 

0 

0 


Hex Coding: F4 with sr = 1 

74 with sr = 0 

Execution Time: 22 to 24 clock cycles 
Description: 

16-bit fixed-point two’s complement integer operand A at the TOS 
is subtracted from zero. The result R replaces A at the TOS. All 
other operands are unchanged. 

Overflow status will be set and the TOS will be returned un¬ 
changed when A is input as the most negative value possible in 
the format since no positive equivalent exists. 

Status Affected: Sign, Zero, Overflow 

STACK CONTENTS 


BEFORE AFTER 


A 

^ TOS ► 

p 

M 



B 


B 

C 


C 

D 


D 

E 


E 

F 


F 

G 


G 

H 


H 


1-^16-*- 


COS 

32-BIT FLOATING-POINT COSINE 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

0 

0 

0 

1 

1 


Hex Coding: 83 with sr = 1 
03 with sr = 0 

Execution Time: 3840 to 4878 clock cycles 
Description: 

The 32-bit floating-point operand A at the TOS is replaced by 
R, the 32-bit floating-point cosine of A. A is assumed to be in 
radians. Operands A, C and D are lost. B is unchanged. 

The COS function can accept any input data value that can 
be represented in the data format. All input values are range 
reduced to fall within an inten/al of -7r/2 to -l-7r/2 radians. 
Accuracy: COS exhibits a maximum relative error of 5.0 x 
10 “^ for all input data values in the range of -27r 
to -\-2tt radians. 

Status Affected: Sign, Zero 

STACK CONTENTS 


BEFORE AFTER 


A 

--TOS-► 

R 

B 


B 

C 



D 

1—-32-► 

1 

1—-32-► 


DADD 

32-BIT FIXED-POINT ADD 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

1 

0 

1 

1 

0 

0 


Hex Coding: AC with sr = 1 
2 C with sr = 0 

Execution Time: 20 to 22 clock cycles 
Description: 

The 32-bit fixed-point two’s complement integer operand A at the 
TOS is added to the 32-bit fixed-point two’s complement integer 
operand B at the NOS. The result R replaces operand B and the 
Stack is moved up so that R occupies the TOS. Operand B is lost. 
Operands A, C and D are unchanged. If the addition generates a 
carry it is reported in the status register. 

If the result is too large to be represented by the data format, the 
least significant 32 bits of the result are returned and overflow 
status is reported. 

Status Affected: Sign, Zero, Carry, Error Field 

STACK CONTENTS 


BEFORE AFTER 


A 

--TOS-► 

R 

B 


C 

C 


D 

D 


A 

--32-► 

!-•-32-^ 
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DDIV 

32-BIT FIXED-POINT DIVIDE 


DMUU 

32-BIT FIXED-POINT MULTIPLY, UPPER 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

1 

0 

1 

1 

1 

1 


Hex Coding: AF with sr = 1 
2F with sr = 0 

Execution Time: 196 to 210 clock cycles when A 5 ^ 0 
18 clock cycles when A = 0. 

Description: 

The 32-bit fixed-point two’s complement integer operand B at 
NOS is divided by the 32-bit fixed-point two’s complement in¬ 
teger operand A at the TOS. The 32-bit integer quotient R re¬ 
places B and the stack is moved up so that R occupies the 
TOS. No remainder is generated. Operands A and B are lost. 
Operands C and D are unchanged. 

If A is zero, R is set equal to B and the divide-by-zero error 
status will be reported. If either A or B is the most negative 
value possible in the format, R will be meaningless and the 
overflow error status will be reported. 

Status Affected: Sign, Zero, Error Field 


STACK CONTENTS 

BEFORE 


AFTER 



DMUL 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

1 

1 

0 

1 

1 

0 


Hex Coding: B 6 with sr = 1 
36 with sr = 0 

Execution Time: 182 to 218 clock cycles 
Description: 

The 32-bit fixed-point two’s complement integer operand A at 
the TOS is multiplied by the 32-bit fixed-point two’s comple¬ 
ment integer operand B at the NOS. The 32-bit most signifi¬ 
cant half of the product R replaces B and the stack is moved 
up so that R occupies the TOS. The least significant half of 
the product is lost. Operands A and B are lost. Operands C 
and D are unchanged. 

If A or B was the most negative value possible in the format, 
overflow status Is set and R is meaningless. 

Status Affected: Sign, Zero, Overflow 


BEFORE 


STACK CONTENTS 


AFTER 



DSUB 

32-BIT FIXED-POINT SUBTRACT 


32-BIT FIXED-POINT MULTIPLY, LOWER 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

. 1 

0 

1 

1 

1 

0 


Hex Coding: AE with sr = 1 
2E with sr = 0 

Execution Time: 194 to 210 clock cycles 
Description: 

The 32-bit fixed-point two’s complement integer operand A at the 
TOS is multiplied by the 32-bit fixed-point two’s complement in¬ 
teger operand B at the NOS. The 32-bit least significant half of the 
product R replaces B and the stack is moved up so that R oc¬ 
cupies the TOS. The most significant half of the product is lost. 
Operands A and B are lost. Operands C and D are unchanged. 
The overflow status bit is set if the discarded upper half was 
non-zero. If either A or B is the most negative value that can 
be represented in the format, that value Is returned as R and 
the overflow status is set. 

Status Affected: Sign, Zero, Overflow 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

1 

0 

1 

1 

0 

1 


Hex Coding: AD with sr = 1 
2D with sr = 0 

Execution Time: 38 to 40 clock cycles 
Description: 

The 32-bit fixed-point two’s complement operand A at the 
TOS is subtracted from the 32-bit fixed-point two’s comple¬ 
ment operand B at the NOS. The difference R replaces 
operand B and the stack is moved up so that R occupies the 
TOS. Operand B is lost. Operands A, C and D are un¬ 
changed. 

If the subtraction generates a borrow it Is reported in the carry 
status bit. If A is the most negative value that can be rep¬ 
resented in the format the overflow status is set. If the result 
cannot be represented in the data format range, the overflow 
bit is set and the 32 least significant bits of the result are re¬ 
turned as R. 

Status Affected: Sign, Zero, Carry, Overflow 


BEFORE 


STACK CONTENTS 


AFTER 



BEFORE 

A 

B 

C 

D 

— 32- 


STACK CONTENTS 

AFTER 


--TOS-► 

R 

C 

D 

A 

1 1 

1—-32-► 
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PTOF 

PUSH 32-BIT 
TOS ONTO STACK 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

1 

0 

1 

1 

1 


Hex Coding: 97 with sr = 1 
17 with sr = 0 

Execution Time: 20 clock cycles 
Description: 

The 32-bit stack is moved down and the previous TOS is copied 
into the new TOS location. Operand D is lost. All other operand 
values are unchanged. PTOF and PTOD execute the same op¬ 
eration. 

Status Affected: Sign, Zero 


STACK CONTENTS 


BEFORE 

AFTER 

A 

--TOS-► 

A 

B 


A 

C 


B 

D 


C 

-32-► 


--32-► 


PUPI 

PUSH 32-BIT 
FLOATING-POINT IT 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

1 

1 

0 

1 

0 


Hex Coding: 9A with sr = 1 
1 A with sr = 0 

Execution Time: 16 clock cycles 
Description: 

The 32-bit stack is moved down so that the previous TOS oc¬ 
cupies the new NOS location. 32-bit floating-point constant tt is 
entered into the new TOS location. Operand D is lost. Operands 
A, B and C are unchanged. 

Status Affected: Sign, Zero 


STACK CONTENTS 


BEFORE 

AFTER 

A 

--TOS-► 

TT 

B 


A 

C 


B 

D 


C 

l-i-32- 

--32-► 


PTOS 

PUSH 16-BIT 
TOS ONTO STACK 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

1 

1 

1 

0 

1 

1 

1 


Hex Coding: F7 with sr = 1 
77 with sr = 0 

Execution Time: 16 clock cycles 
Description: 

The 16-bit stack is moved down and the previous TOS is copied 
into the new TOS location. Operand H is lost and all other 
operand values are unchanged. 

Status Affected: Sign, Zero 


STACK CONTENTS 

BEFORE AFTER 



-r/-\o_^ 


A 

-I Uo ^ 


B 


A 

C 


B 
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C 
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D 

F 


E 

G 


F 

H 
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1—16-»- 


-^16-*- 
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PWR 

32-BIT 

FLOATING-POINT 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

0 

1 

0 

1 

1 


Hex Coding: 8 B with sr = 1 
OB with sr = 0 

Execution Time: 8290 to 12032 clock cycles 
Description: 

32-bit floating-point operand B at the NOS is raised to the power 
specified by the 32-bit floating-point operand A at the TOS. The 
result R of B^ replaces B and the stack is moved up so that R 
occupies the TOS. Operands A, B, and D are lost. Operand C is 
unchanged. 

The PWR function accepts all input data values that can be 
represented in the data format for operand A and all positive 
values for operand B. If operand B is non-positive an error status 
of 0100 will be returned. The EXP and LN functions are used to 
implement PWR using the relationship B^ = EXP [A(LN B)]. 
Thus if the term [A(LN B)] is outside the range of -1 .0 x 2^^ to 
+ 1.0 X 2 '*'® an error status of 1 1 00 will be returned. Underflow and 
overflow conditions can occur. 

Accuracy: The error performance for PWR is a function of 
the LN and EXP performance as expressed by; 
[(Relative Error)pyvR|= [(Relative Error)Exp+[A(Absolute 
Error)LN| 

The maximum relative error for PWR occurs when 
A is at its maximum value while [A(LN B)] Is near 
1.0 X 2^ and the EXP error is also at its maxi¬ 
mum. For most practical applications the relative 
error for PWR will be less than 7.0 x 10“^. 

Status Affected: Sign, Zero, Error Field 


STACK CONTENTS 

BEFORE AFTER 




SADD 

16-BIT 

FIXED-POINT ADD 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

1 

1 

0 

1 

1 

0 

0 


Hex Coding: EC with sr = 1 
6 C with sr = 0 

Execution Time: 16 to 18 clock cycles 
Description: 

16-bit fixed-point two’s complement integer operand A at the 
TOS is added to 16-bit fixed-point two’s complement integer 
operand B at the NOS. The result R replaces B and the stack 
is moved up so that R occupies the TOS. Operand B is lost. 
All other operands are unchanged. 

If the addition generates a carry bit it is reported in the status 
register. If an overflow occurs it is reported in the status regis¬ 
ter and the 16 least significant bits of the result are returned. 
Status Affected: Sign, Zero, Carry, Error Field 


STACK CONTENTS 


BEFORE AFTER 


A 

T/^C' 


M 

^ 1 Uo 

R 

B 


C 

C 


D 

D 


E 

E 


F 

F 


G 

G 


H 

H 


A 

-^16-*- 

F^16 — 
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SDIV 

16-BIT 

FIXED-POINT DIVIDE 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

1 

1 

0 

1 

1 

1 

1 


Hex Coding: EF with sr = 1 
6F with sr = 0 

Execution Time: 84 to 94 clock cycles for A 0 
14 clock cycles for A = 0 

Description: 

16-bit fixed-point two’s complement integer operand B at the 
NOS is divided by 16-bit fixed-point two’s complement integer 
operand A at the TOS. The 16-bit integer quotient R replaces B 
and the stack is moved up so that R occupies the TOS. No 
remainder is generated. Operands A and B are lost. All other 
operands are unchanged. 

If A is zero, R will be set equal to B and the divide-by-zero error 
status will be reported. 

Status Affected: Sign, Zero, Error Field 


STACK CONTENTS 

BEFORE AFTER 




SIN 

32-BIT 

FLOATING-POINT SINE 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

0 

0 

0 

1 

0 


Hex Coding: 82 with sr = 1 
02 with sr = 0 

Execution Time: 3796 to 4808 clock cycles for lAl > 
radians 

30 clock cycles for lAl ^ 2“^^ radians 

Description: 

The 32-bit floating-point operand A at the TOS is replaced by 
R, the 32-bit floating-point sine of A. A is assumed to be in 
radians. Operands A, C and D are lost. Operand B is un¬ 
changed. 

The SIN function will accept any input data value that can be 
represented by the data format. All input values are range re¬ 
duced to fall within the interval - 77/2 to +ttI2 radians. 
Accuracy: SIN exhibits a maximum relative error of 5.0 x 
10“^ for input values in the range of -27r to -i-27r 
radians. 

Status Affected: Sign, Zero 


STACK CONTENTS 


BEFORE AFTER 


A 

--TOS-► 

R 

B 


B 

C 



D 



--32-► 


--32-- 


18 






















































SMUL 

16-BIT FIXED-POINT 
MULTIPLY, LOWER 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

1 

1 

0 

1 

1 

1 

0 


Hex Coding: EE with sr = 1 
6 E with sr = 0 

Execution Time: 84 to 94 clock cycles 
Description: 

16-bit fixed-point two’s complement integer operand A at the TOS 
is multiplied by the 16-bit fixed-point two’s complement integer 
operand B at the NOS. The 16-bit least significant half of the 
product R replaces B and the stack is moved up so that R 
occupies the TOS. The most significant half of the product Is lost. 
Operands A and B are lost. All other operands are unchanged. 
The overflow status bit is set if the discarded upper half was 
non-zero. If either A or B is the most negative value that can be 
represented in the format, that value is returned as R and the 
overflow status is set. 

Status Affected: Sign, Zero, Error Field 


STACK CONTENTS 

BEFORE AFTER 




SMUU 

16-BIT FIXED-POINT 
MULTIPLY, UPPER 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

1 

1 

1 

0 

1 

1 

0 


Hex Coding: F6 with sr = 1 

76 with sr = 0 

Execution Time: 80 to 98 clock cycles 
Description: 

16-bit fixed-point two’s complement integer operand A at the 
TOS is multiplied by the 16-bit fixed-point two’s complement 
integer operand B at the NOS. The 16-bit most significant half 
of the product R replaces B and the stack is moved up so that 
R occupies the TOS. The least significant half of the product 
is lost. Operands A and B are lost. All other operands ,are un¬ 
changed. 

If either A or B is the most negative value that can be rep¬ 
resented in. the format, that value is returned as R and the 
overflow status is set. 

Status Affected: Sign, Zero, Error Field 


STACK CONTENTS 


BEFORE AFTER 
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-m Tr^c> 

D 
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F 
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-*—16—*^ 
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SORT 

32-BIT FLOATING-POINT SQUARE ROOT 


TAN 

32-BIT FLOATING-POINT TANGENT 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

0 

0 

0 

0 

1 


Hex Coding: 81 with sr = 1 
01 with sr = 0 

Execution Time: 782 to 870 clock cycles 
Description: 

32-bit floating-point operand A at the TOS is replaced by R, the 
32-bit floating-point square root of A. Operands A and D are lost. 
Operands B and C are not changed. 

SORT will accept any non-negative input data value that can be 
represented by the data format. If A is negative an error code of 
0100 will be returned in the status register. 

Status Affected: Sign, Zero, Error Field 


BEFORE STACK CONTENTS AFTER 


A 

--TOS-► 

R 

B 


B 

C 

r\ 


C 

U 

-32-- 


--32-► 


SSUB 


16-BIT FIXED-POINT SUBTRACT 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

1 

1 

0 

1 

1 

0 

1 


Hex Coding: ED with sr = 1 
6 D with sr = 0 

Execution Time: 30 to 32 clock cycles 
Description: 

16-bit fixed-point two’s complement integer operand A at the 
TOS is subtracted from 16-bit fixed-point two’s complement in¬ 
teger operand B at the NOS. The result R replaces B and the 
stack is moved up so that R occupies the TOS. Operand B is 
lost. All other operands are unchanged. 

If the subtraction generates a borrow it is reported in the carry 
status bit. If A is the most negative value that can be rep¬ 
resented in the format the overflow status is set. If the result 
cannot be represented in the format range, the overflow 
status is set and the 16 least significant bits of the result are 
returned as R. 

Status Affected: Sign, Zero, Carry, Error Field 


BEFORE 


STACK CONTENTS 


AFTER 




7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

0 

0 

1 

0 

0 


Hex Coding: 84 with sr = 1 
04 with sr = 0 

Execution Time: 4894 to 5886 clock cycles for lAI > 2“^^ 
radians 

30 clock cycles for lAI ^ 2~^^ radians 

Description: 

The 32-bit floating-point operand A at the TOS is replaced by 
the 32-bit floating-point tangent of A. Operand A is assumed 
to be in radians. A, C and D are lost. B is unchanged. 

The TAN function will accept any input data value that can be 
represented in the data format. All input data values are 
range-reduced to fall within -ttM to TttM radians. TAN is un* 
bounded for input values near odd multiples of ttI2 and in 
such cases the overflow bit is set in the status register. For 
angles smaller than 2“^^ radians, TAN returns A as the tan¬ 
gent of A. 

Accuracy: TAN exhibits a maximum relative error of 5.0 x 
10 “^ for input data values in the range of -2tt to 
-F27r radians except for data values near odd mul¬ 
tiples of 7tI2. 

Status Affected: Sign, Zero, Error Field (overflow) 


BEFORE STACK CONTENTS AFTER 


A 

--TOS-► 

R 

B 

C 

B 

D 

1--32-► 


--32-► 


XCHD 

EXCHANGE 32-BIT STACK OPERANDS 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

1 

1 

1 

0 

0 

1 


Hex Coding: B9 with sr = 1 
39 with sr = 0 

Execution Time: 26 clock cycles 
Description: 

32-bit operand A at the TOS and 32-bit operand B at the NOS 
are exchanged. After execution, B is at the TOS and A is at 
the NOS. All operands are unchanged. XCHD and XCHF 
execute the same operation. 

Status Affected: Sign, Zero 
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XCHF 

EXCHANGE 32-BIT 
STACK OPERANDS 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

0 

0 

1 

1 

0 

0 

1 


Hex Coding: 99 with sr = 1 
19 with sr = 0 

Execution Time: 26 clock cycles 
Description: 

32-bit operand A at the TOS and 32-bit operand B at the NOS 
are exchanged. After execution, B is at the TOS and A is at 
the NOS. All operands are unchanged. XCHD and XCHF 
execute the sarhe operation. 

Status Affected: Sign, Zero 


XCHS 

EXCHANGE 16-BIT 
STACK OPERANDS 



7 

6 

5 

4 

3 

2 

1 

0 

Binary Coding: 

sr 

1 

1 

1 

1 

0 

0 

1 


Hex Coding: F9 with sr = 1 
79 with sr = 0 


Execution Time: 18 clock cycles 
Description: 

16-bit operand A at the TOS and 16-bit operand B at the NOS 
are exchanged. After execution, B is at the TOS and A is at 
the NOS. All operand values are unchanged. 

Status Affected: Sign, Zero 
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